Search CORE

10 research outputs found

Index structures for distributed text databases

Author: Marin Cahiuan Juan Mauricio
Publication venue
Publication date: 01/04/2004
Field of study

The Web has became an obiquitous resource for distributed computing making it relevant to investigate new ways of providing efficient access to services available at dedicated sites. Efficiency is an ever-increasing demand which can be only satisfied with the development of parallel algorithms which are efficient in practice. This tutorial paper focuses on the design, analysis and implementation of parallel algorithms and data structures for widely-used text database applications on the Web. In particular we describe parallel algorithms for inverted files and suffix arrays structures that are suitable for implementing search engines. Algorithmic design is effected on top of the BSP model of parallel computing. This model ensures portability across diverse parallel architectures ranging from clusters to super-computers.Facultad de Informátic

Index structures for distributed text databases

Author: Marin Cahiuan Juan Mauricio
Publication venue
Publication date: 01/04/2004
Field of study

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Servicio de Difusión de la Creación Intelectual

Index structures for distributed text databases

Author: Marin Cahiuan Juan Mauricio
Publication venue
Publication date: 10/08/2004
Field of study

Servicio de Difusión de la Creación Intelectual

In order to be able to perform multimedia searches (like sounds, videos, images, etc.) we have to use data structures like the Spatial Approximation Tree (SAT). This structure is a nice example of a tree structure in which well-known tricks for tree parallelization simply do not work. It is too sparse, unbalanced and its performance is too dependent on the work-load generated by the queries being solved by means of searching the tree. The complexity measure is given by the number of distances computed to retrieve those objects close enough to the query. In this paper we examine some alternatives to parallelize this structure through the MPI library and the BSPpub library.Facultad de Informátic

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

A parallel search algorithm for the SAT

Author: Gil Costa Graciela Verónica
Marin Cahiuan Juan Mauricio
Printista Alicia Marcela
Reyes Nora Susana
Publication venue
Publication date: 26/05/2008
Field of study

Servicio de Difusión de la Creación Intelectual

Buckets inverted list for a search engine with BSP

Author: Gil Acosta Graciela Verónica
Marin Cahiuan Juan Mauricio
Printista Alicia Marcela
Publication venue: Universidad Nacional de La Plata. Facultad de Informática
Publication date: 01/04/2006
Field of study

Most information in science, engineering and business has been recorded in form of text. This information can be found online in the World-Wide-Web. One of the major tools to support information access are the search engines which usually use information retrieval techniques to rank Web pages based on a simple query and an index structure like the inverted lists. The retrieval models are the basis for the algorithms that score and rank the Web pages. The focus of this presentation is to show some inverted lists alternatives, based on buckets, for an information retrieval system. The main interest is how query performance is effected by the index organization on a cluster of PCs. The server design is effected on top of the parallel computing model Bulk Synchronous Parallel-BSP.Fil: Printista, Alicia Marcela. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - San Luis; Argentina. Universidad Nacional de San Luis. Facultad de Ciencias Físico Matemáticas y Naturales. Departamento de Informática. Laboratorio Investigación y Desarrollo en Inteligencia Computacional; ArgentinaFil: Gil Acosta, Graciela Verónica. Universidad Nacional de San Luis. Facultad de Ciencias Físico Matemáticas y Naturales. Departamento de Informática. Laboratorio Investigación y Desarrollo en Inteligencia Computacional; ArgentinaFil: Marin Cahiuan, Juan Mauricio. Universidad de Magallanes; Chil

CONICET Digital